Corpus: fra_news_2010_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 98 98 98 99
1000 869 967 994 996 998
10000 6306 8639 9675 9926 9978
100000 6307 8640 9676 9927 9979
1000000 6307 8640 9676 9927 9979


Zipf's diagram for sentence endings


Gnuplot diagram

977 msec needed at 2018-03-03 08:44